UNIC — Unicode Character Database
This UNIC component provides access to character properties as defined in the Unicode® Standard Annex #44 - Unicode Character Database.
UCD is a UNIC super-crate, composed of smaller crates that provide data in specific areas, therefore, allowing access only to the data needed instead of forcing dependent crates to import all UCD data.
Crates
Here's a list of components (available or planned) for this super-crate:
-
version
: The Unicode Version of UCD data. -
common
: Common properties, such as Alphabetic, White-Space, Control and Numeric. -
age
: Age property. -
bidi
: Bidirectional properties. (Hebrew, Arabic, ...) -
block
: Block properties. -
case
: Letter Case properties. -
category
: General_Category property. -
hangul
: Hangul Syllable Composition & Decomposition. -
ident
: Identifier properties. -
name
: Name property. -
normal
: Normalization properties. -
segment
: Segmentation properties. -
ea-width
: East Asian Width properties. -
joining
: Cursive joining properties. (Arabic, Syriac, ...) -
numeric
: Other character numeric properties. -
script
: Script properties.
See http://unicode.org/reports/tr44/#Property_List_Table for the complete
list of properties defined in UCD. Eventually, all these properties will be
available by under unic-ucd
.